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Encoding multiword information by wordwise interleaving 


Abstract 

Multiword information is based on multibit symbols disposed in relative contiguity with respect to a 
medium, and is encoded with a wordwise interleaving and wordwise error protection code for 
providing error locative clues across multiword groups. In particular, the clues originate in high 
protectivity clue words (BIS) that are interleaved among clue columns, and also in synchronizing 
columns constituted from synchronizing bit groups. The synchronizing columns are located where the 
clue colunms are relatively scarcer disposed. The clues are directed to low protectivity target words 
(LDS) that are interleaved in a substantially uniform manner among target columns which form 
uniform-sized column groups between periodic arrangements of clue columns and synchronizing 
columns. - 
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Claims 


What is claimed is: 

1 . A method for encoding multiword information that is based on multibit symbols disposed in 
relative contiguity with respect to a medium, whilst providing wordwise interleaving and wordwise 
error protection code facilities, for so providing error locative clues across multiword groups, 

characterized by originating such clues both in high protectivity clue words (BIS) that are interleaved 
among clue columns and also in synchronizing columns constituted from synchronizing bit groups 
and locating said synchronizing colunms where said clue columns are relatively scarcer disposed, said 
clues being directed to low protectivity target words (LDS) that are interleaved in a substantially 
uniform manner among target colunms which form uniform-sized column groups between periodic 
arrangements of clue columns and synchronizing columns. 

2. A method as claimed in claim 1, and arranging said information into uniform-sized physical 
clusters that each have a single synchronizing column and an odd number of clue columns. 

3. A method as claimed in claim 1, whilst assigning user data exclusively to said target columns and 
* assigning system data at least predominantly to said clue columns. 

4. A method as claimed in claim 1, wherein at a lowest level a multi-symbol data frame of target 
symbols contains a multi-symbol but bit-organized error detection bit group EDC. 

5. A method as claimed in claim 4, wherein at a next higher level an ECC sector contains a plurality 
of data frames for distribution over a plurality of target words through addition of Reed-Solomon 
redundancy. 

6. A method as claimed in claim 5, while before said interleaving sequentially separating various code 
word blocks as a speeding up measure with respect to later decoding (FIG. 1 1). 

7. A method as claimed in claim 1, whilst superposing said interleaving on row- wise incrementally 
rotating of target symbols within their cluster. 

8. A method as claimed in claim 1 applied to storage on an optical medium. 
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9. A method as claimed in claim 1, wherein all clue words anc target words carry a uniform amount 
of redundancy, but target words have more data symbols than clue words. 

10. A method as claimed in claim 1, wherein the clue words of a comprehensive Physical Stor;,gt 
Cluster number a multiple (24/3) of the number of clue columns in the Physical Cluster; 
like-numbered symbols of the clue words being distributed into groups of as many symbols as there 
are clue columns in a Physical Cluster over like-ranked Recording Frames of different Physical 
Sectors of the Physical Cluster, in a staggered but otherwise uniform interleaving schedule among the 
various Recording Frames. 

1 1. A method as claimed in claim 10, wherein even-numbered clue symbol rows are assigned to a first 
contiguous group of half of the Physical Sectors, and odd-numbered clue symbol rows are assigned to 

12. A method as claimed in claim 10, wherein the rows of clue column symbols are subjected to a 
staggered and systematic rotating among the various clue colunms. 

13. A method as claimed in claim 10, whilst assigning to clue words both logical and physical address 
data pertaining to the actual Physical Cluster. 

14. A unitary storage carrier produced by a method as claimed in claim 1, and storing multiword 
information that is based on multibit symbols disposed in relative contiguity thereon, with wordwise 
interleaving and wordwise error protection code facilities that provide error locative clues across 
multiword groups, 

characterized by originating such clues both in high protectivity clue words that are interleaved 
among clue colunms and also in synchronizing columns constituted from synchronizing bit groups 
and locating said synchronizing columns where said clue columns are relatively scarcer disposed, all 
clues being directed to low protectivity target words thai: are interleaved in a substantially uniform 
manner among target columns which form uniform-sized colimm groups between periodic 
arrangements of clue columns and synchronizing columns. 

15. A carrier as claimed in claim 14, having arranged said information into uniform-sized physical 
clusters that each have a single synchronizing column and an odd number of clue columns. 

16. A carrier as cklmed in claim 14, having user data exclusively contained in target colun-jis and 
system data at least substantially exclusively contained in clue columns. 

17. A carrier as claimed in claim 14, wherein at a lowest level a multi-symbol data frame of target 
symbols also contains a multi-symbol but bit-organized error detection bit group. 

18. A carrier as claimed in claim 17, wherein at a next higher level a data sector contains a plurality of 
data frames distributed over a plurality of target words through added Reed-Solomon redundancy. 

19. A carrier as claimed in claim 18, having superposed said interleaving on row-wise incrementally 
rotation of target symbols within their cluster. 

20. A carrier as claimed in claim 14, and based on an optical storage medium. 

21. A carrier as claimed in claim 14, wherein all clue words and target words carry a uniform amount 


3 of 16 


4/2/2002 11:45 AM 


United States Patent: 6,367,049 



uniform amount of redundancy therein, whilst allowing in target words for more data sj^mbols than in 
clue words. 

34. A method as claimed in claim 26, wherein the clue words of a comprehensive Physi;.al Storage 
Cluster number a mv.ltiple (24/3) of the number of clue columns in the Physical Cluster; 
like-numbered symbols of the clue words being derived from groups of as many symbols as ther?^ are 
clue columns in a Physical Cluster over like-ranked Recording Frames of different Physical Sectors of 
the Physical Cluster, in a staggered but otherwise uniform deinterleaving schedule among the various 
Recording Frames. 

35. A method as claimed in claim 34, wherein even-numbered symbol rows are derived from a first 
contiguous group of half of the Physical Sectors, and odd-numbered symbol rows are derived from 

36. A method as claimed in claim 34, wherein the rows of clue column symbols are derived from 
staggered and systematically rotated symbols along the various clue columns. 

37. A method as claimed in claim 34, whilst deriving from clue words both logical address data and 
physical address data pertaining to the actual Physical Cluster. 

38. A device for encoding multiword information that is based on multibit symbols disposed in 
relative contiguity with respect to a medium, through providing wordwise interleaving means and 
wordwise error protection encoding means, for so providing error locative clues across multiword 
groups, 

characterized by said encoding means directing such clues both to high protectivity clue words that 
are interleaved among clue colunms and also to synchronizing columns constituted from 
synchronizing bit groups through locating means for locating said synchronizing columns where said 
clue columns are relatively scarcer disposed, all clues being directed to low prc'iectivity target words 
that are interleaved in a substantially uniform manner among target columns which form 
uniform-sized column groups between periodic arrangements of clue columns and synchionizing 
columns. 

39. A device as claimed in claim 38, and having arranging means for arranging said information into 
uniform-sized physical clusters that each have a single synchronizing column and an odd number of 

' clue columns. 

40. A device as claimed in claim 38, being arranged for assigning user data exclusively to said target 
colunms and system data at least substantially excbxsive^y to said c^ue columns. 

41. A device as claimed in claim 38, having generator means for at a lowest level in a multi-symbol 
data frame of target symbols also generating a mul:i-symbol but bit-organized error detection bit 
group. 

42. A device as claimed in claim 41, arranged for at a next higher level forming a data sector to 
contain a plurality of data frames for distributed over a plurality of target words thru added 
Reed-Solomon redundancy. 

^3. A device as claimed in claim 38, and having superposing means for superposing caid interleaving 
on row-wise incrementally rotating of target symbols within tficir cluster. 
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44. A device as claimed in claim 38, and having interfacing means for interfacing tc an optical storage 
medium. 

45. A device as claimed in claim 38, wherein said encoding means are arranged for assigning to 
clue words and target words to carry a uniform amount of redundancy, but io target ^;^ords l ia^g^r 
amount of data symbols than to clue words. 

46. A device as claimed in claim 38, wherein the clue words of a comprehensive Physical Storage 
Cluster number a multiple (24/3) of the number of clue columns in a Physical Cluster; said devics 
having distributing means for distributing like-numbered symbols of the clue words in groups of as 
many symbols as there are clue columns ir. a Physical Cluster, over like-ranked Recording Frames of 
different Physical Sectors of the Physical Cluster, in a staggered but otherwise uniform interleaving 
schedule among the various Recording Frames. 

47. A device as claimed in claim 38, having assigning means for assigning even-numbered symbol 
rows to a first contiguous group of half of the Physical Sectors, and odd-numbered symbol rows to the 
second contiguous group of half of the Physical Sectors. 

48. A device as claimed in claim 38, and having rotating means for subjecting the rov/s of ciue 
column symbols to a staggered and systematic rotating among the various clue columns. 

49. A device as claimed in claim 38, and having address assigning means for assigning to clue words 
both logical and physical address means pertaining to the actual Physical Cluster. 

50. A device for decoding multiword information that is based on multibit symbols disposed in 
relative contiguity with respect to ?. medium, thirough effecting wordwise deinterleaving and 
wordwise error code protecting facilities, and so providing error locative clues across multiword 
groups, 

characterized by being rrranged for deriving such clues both from high protectivity clue words that 
are interleaved among clue columns ar?d also from synchrcrizing columns constituted from 
synchronizing bit groups, through accessing means for accessing said synchronizing coiunms where 
said clue columns are relatively scarcer disposed, all clues being directed to lew protectivity target 
words that are interleaved in a substantially unifom manner among target cclumns which form 
uniform-sized column groups between periodic arrangements of clue columns and synchronizing 
* columns. 

51. A device as claimed in claim 50, end having accessing means for accessing said information 
according to uniform-sized physical clusters that each have a single synchronizing column and an odd 
number of clue columns. 

52. A device as claimed in claim 50, being arranged for deriving user data exclusively from target 
columns and system data at least substantially exclusively from clue columns. 

53. A device as claimed in claim 50, and having accessing means for accessing at a lowest level 
v/ithin a data frame a multi-symbol but bit-organized error detection bit group, for therefrom deriving 
an error detecting signal. 

54. A device as claimed in claim 53, wherein said accessing means are arranged for at a ne:it higher 
level accessing within a data sector a plurality of respective data frames distributed over a plurality of 
target words through evaluating added Reed-Solomon redundancy. 
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55. A device as claimed in claim 50, having corrector means for superposing on said interleaving a 
row-wise incrementally backrotating of target symbols within their cluster. 

56. A device as claimed in claim 50, and having interface means for interfacing to an optical storage 
medium. 

57. A device as claimed in claim 50, said decoding means being arranged for decoding all clue words 
all and target words through a uniform amount of redundancy, but for deriving from target words a 
larger amount of data symbols than from clue words. 

58. A device as claimed in claim 50, wherein the clue words of a comprehensive Physical Storage 
Cluster number a multiple (24/3) of the number of clue columns in the Physical Cluster; and being 
arranged for deriving like-numbered symbols of the clue v/ords as being distributed into groups of as 
many symbols as there are clue columns in a Physical Cluster from like-ranked Recording Frames of 
different Physical Sectors of the Physical Cluster, in a staggered tut otherwise uniform interleaving 
schedule among the various Recording Frames. 

59. A device method as claimed in claim 58, and b'jing arranged for deriving even-numbered symbol 
rows from a first contiguous group of half of the Physical Sectors, and odd-numbered symbol rows 
from the second contiguous group of half of the Physical Sectors. 

60. A device as claimed in claim 58, being arranged for deriving the rows of clue column symbols 
through staggered and systematic backrotating over the various clue ccluxiins. 

61. A device as claimed in claim 50, being arranged for deriving both logical and physical address 
data from the clue columns as pertaining to the actual Physical Cluster. 

Description 


BACKGROIL^T) OF TH5 INVErlTION 

, The invention relates to a method according ^o the p reamble of claim 1. U.S. Pat. No. 4,559,625 to 
Berlekamp et al, and U.S. Pat. No. 5,299,208 to Biaum et al disclose decodixig of interleaved and 
error protected information words, wherein an error pattern found in a first word may give a clue to 
locate errors in another word of the same group of wor d 3. The references use l fault model with 
multisymbol error bursts across plural words. An error ir particular word yields a high prcbabilitj' 
for an error to occur at a corresponding symbol position pointed at in a next word or words. The 
procedure may raise the number of corrected errors. A clue will however only materialize when the 
clue word has been frilly corrected. Furthermore, the medium stores information as well as 
synchronization bit groups which represent an appreciable am^ount of redundancy that may also be 
used to signal errors in the target words, so that to some extent clue words may be exchanged for 
synchronization bit groups. Part of the errors is due to so-called bit slips that are more frequent fiirther 
away from the sync bit groups. Therefore, int'=^rIe^ving the clue words amongst clue columns and also 
interleaving the target words amongst target coiumns may fiirther improve the error protection. 

SUMMARY TO THE INVENTION 

In consequence, amongst other things, it is an object of the present invention to provide a coding 
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format allowing clue words to cooperate with synchronizing bit groups in a systematic format, while 
also smoothing out possibly negative effects of bit slips more evenly among the various v/ords. Now 
therefore, according to one of its aspects the invention is characterized as recited in the characterizing 
part of claim 1. A clue can point to an erasure S3/mbol. Poii ting can make error correction more 
powerful. In fact, many codes will correct at most t errors when no error locative indication is knowr. 
Given the erasure locations, generally a larger number e>t of erasures may be corrected. Also 
protection against a combination of bursts and random errors may improve. The invention may be 
used both for storage and for transmission. 

Advantageously, the method has the features of claim 2. This is a relatively simple arrangement. 
Altematively, the number of synchronizing colunms may be larger than one, and the number of clue 
columns may be even if preferred. 

Advantageously, the method has the features of claim 3. In video recording, user data will relate to 
the image and accompanying sound to be presented to a user, whereas system data may indicate the 
program name, time, addresses, and various other parameters that may be useful without reference to 
the video or audio proper. This feature allows fast access to the system data without necessity for 
decoding the target words. 

Advantageously, the method has the features of claim 4. If error-free, this feature will immediately 
indicate to a user device v^hether the processed frame of information vz correct. 

Advantageously, the method has the features of claim 5. This is a straigiitfonvard organization. 
Advantageously, the method has the features of claim 6. This has proven to raise the worst case to the 
average level attainable for this code format. Advantageously, the method has the features of claim 7. 
Optical storage has been proven a fortuitous medium. 

The invention also relates to a method for decoding information so encoded, to an encoding and/or 
decoding device fSr use with the method, and to a carrier conteining information so encoded. Further 
advantageous aspects of the invention ^re recite i in de;:endtnt claims. 

BRIEF DESCRIPTION OF THE DRAWING 

These and further aspects anl advantages of the invention will be discussed more in detail hereinafter 
with reference to the disclosure of preferred embodiments, and in particular with reference to the 
* appended FIGURES that show: 

FIGS, la, lb, record carrier; 

FIG. 2, a playback device: 

FIG. 3, a recording device; 

FIG. 4, a system with encoder, carrier, and decoc er; 
FIG. 5, a code format principle; 

FIG, 6, a schematic representation of a physical c^us jer on the carrier; 
FIG. 7, a data frame; 
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FIG. 9, renumbering data bytes and forming an ECC sect' ; by ^dding parities; 
FIG. 10, a composition of an ECC cluster by multiplexing : 6 .^CC sectors; 
FIG. 1 1, a renumbered ECC cluster before interleaving; 
FIG. 12, an interleaved ECC cluster; 

FIG. 13, multiplexing of the BIS cluster with the interleaved ECC cluster; 
FIG. 14, a BIS block containing 24 BIS code words; 


FIG. 15, the frame structure for explaining main data intCi isaving; 


FIG. 16, the interleaving proper; 


FIG. 17, an example of the partial mapping cf BIS bytes on the ^irst eight sectors; 
FIG. 18, an example of the partial mapping of BIS bytes on the las^ eight sectors; 
FIG. 19, a schematic representation of the overall encoding process. 
DETAILED DESCRIPTION OF PREFERRED EMBODIMENTS 

FIG. la shows a ^sc-shaped record carrier 1 1 with track 19 and central hole 10. Track 19 is arranged 
in a spiral pattern of turns forming substantially parallel tracks on an information layer. The carrier 
may be an optical disc with a recordable or prerecorded information layer. CD-R, CD-RW, and 
DVD-RAM are recordable, whereas audio-CD is an example of a prerecorded disc. The prerecorded 
type can be manufactured in a known way by first recording a master disc and then pressing consumer 
discs. On a recordable carrier the track is indicated by a pre-embossed track structure provided during 
manufacture of the blank carrier. The track may be structured as a pregroove 14 to enable a read/write 
* head to follow the track 19 during scanning. Information is represented on the information layer by 
optically detectable marks, e.g. pits and lands. 

FIG. lb is a cross-section along line b~b of the record carrier 1 1 of the recordable type, transparent 
substrate 15 being provided with recording layer 16 and protective layer 17. Pregroove 14 may be 
implemented as an indentation or an elevation, or as a material property deviating from its 
surroundings. 

The record carrier may carry user information, that for user convenience has been sub-divided into 
smaller items, that each may have a duration of a few minutes e.g. songs of an album, or movements 
of a symphony. Access information for identifying the items can also be provided on the carrier, e.g. a 
so-called Table Of Contents (TOC) or a file system like ISO 9660 for CD-ROM. The access 
information may include playing time and start address for each item, and further information like a 
song title. Such information may represent system information. The information is recorded in a 
digital manner after analog to digital (A/D) conversion. 
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FIG. 2 shows a playback apparatus according to the invention for reading a record carrier 1 1 such as 
shown in FIG. L The device has a drive 21 for rotating the record carrier 1, and read head 22 for 
scanning the track on the record carrier. The apparatus has positioning means 25 for coarse radial 
positioning of the read head 22. The read head comprises an optical system of a known type for 
generating a radiation beam 24 guided through optical elements focussed to a radiation spot 23 on a 
track of the information layer. The radiation beam 24 is generated by a known radiation source. The 
read head further has a focusing actuator for moving the focus of the radiation beam 24 along the 
optical axis of the beam and a tracking actuator for fine positioning of the spot 23 in a radial direction 
on the track centre. The tracking actuator may comprise coils for radially moving an optical element 
or may be arranged for changing the angle of a reflecting element. The radiation reflected by the 
information layer is detected by a detector of a usual type, e.g. a four-quadrant diode, in the read head 
22 for generating a read signal and further detector signals including a tracking error and a focusing 
error signal coupled to said tracking and focusing actuators. The read signal is processed by reading 
means 27 to retrieve the data, which reading means are of a usual type for example comprising a 
channel decoder and an error corrector. The retrieved data is sent to data selection facility 28 to select 
certain information fi-om the data read and sends it to buffer 29. The selection is based on data type 
indicators recorded on the record carrier, e.g. headers in a framed format. The compressed 
information is sent from buffer 29 to de-compressor 3 1 via signal 30. This signal may also be 
externally available. The de-compressor 3 1 decodes the data to reproduce the original information on 
output 32. The de-compressor may be fitted separately, as indicated by the rectangle 33 in FIG. 2. 
Alternatively, the buffer may be positioned before the data selection. Control unit 20 furthermore 
receives control commands from a user or from a host computer via control lines 26 such as a system 
bus, connecting drive 21, positioning means 25, reading means 27 and data sele-^tion means 28, and 
possibly also buffer 29 for buffer filling level control. Thereto, control unit 20 comprises control 
circuitry, such as a microprocessor, a program memory and control gates, or a state machine. 

Compression and de-compression are well-known. During de-compression an inverse process is 
applied to reconstruct the original signal. If the original digitized signal is reconstructed exactly, the 
(de-)compressionls lossless, but in lossy (de)-compression some details of the original signal will not 
be reproduced. Such omitted details are substantially undetectable by the human ear or eye. Most 
known systems, such as MPEG, use lossy compression for audio and video; lossless compression is 
used from storing computer data. 

The data selection 28 is arranged for retrieving control information from the data read, and for 
discarding any stuffing data that had been added during recording. The rotation rate may be adjusted 
using the average filling level of the buffer 29, e.g. decreasing the rotation rate when the buffer is 
more then 50% full on average. 

FIG. 3 shows a recording device for writing infonnation on a (re)writable record carrier 1 1 . During 2. 
writing operation, marks representing the information are formed on the record carrier. The mrrks 
may be in any optically readable form, e.g. in the form of areas whose reflection coefficient differs 
from their surroundings, through recording in materials such as dye, alloy or phase change, or in th^ 
form of areas with a direction of magnetization different from their surroundings, when recording in 
magneto-optical material. Writing and reading of information for recording on optical disks and 
usable formatting, error correcting and channel coding rules are well-known in the art, e.g. from th'^ 
CD system. The marks can be formed through a spot 23 generated on the recording layer via a 
electromagnetic radiation beam 24 from a laser diode. The recording device comprises similar basi : 
elements as the apparatus for reading described above in FIG. 2, i.e. control unit 20, drive means T. 
and positioning means 25, but it has a v/rite head 39. The information is presented on the input of 
compression means 35, that may be placed in a separate housing. The variable bitrate compress'^d 
information on the output of the compression means 35 is passed to buffer 36. From the buffer 3c tht 
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data is passed to data combination means 37 for adding stuffing data and further control data. The 
total data stream to be recorded is passed to writing means 38. The write head 39 is coupled to the 
writing means 38, which comprise a formatter, an error encoder and a channel modulator. The data 
presented to the input of the writing means 38 is distributed over logical and physical sectors 
according to formatting and encoding rules discussed infra, and converted into a write signal for write 
head 39. Unit 20 is arranged to control the buffer 36, the data combination means 37 and the writing 
means 38 via control lines 26 and for performing the positioning procedure as described above for the 
reading apparatus. The recording apparatus may be arranged for reading through the features of a 
playback apparatus and a combined write/read head. 

FIG. 4 shows a comprehensive system according to the invention, provided with an encoder, a carrier, 
and a decoder. The embodiment is used for encoding, storing, and finally decoding a sequence of 
samples or multibit symbols derived from an audio or video signal, or from data. Terminal 120 
receives a stream that may have eight bit symbols. Splitter 122 recurrently and cyclically transfers 
first symbols intended for the clue words to encoder 124, and all other symbols to encoder 126. In 
encoder 124 the clue words are formed by encoding the data into code words of a first multi-symbol 
error correcting code. This code may be a Reed-Solomon code, a product code, an interleaved code, 
or a combination. In encoder 126 the target words are form.ed by encoding into code words of a 
second multi-symbol error correcting code. In FIG. 5, all code words have a uniform length, but this 
is not a restriction. The clue words may have a much higher degree of error protection. This may be 
effected by a larger number of check symbols, by a smaller number of data symbols, or by a 
combination thereof. 

In block 128, the code words are transferred to one or more outputs of which an erbitrary number has 
been indicated, so that the distribution on a medium to be discussed later will become uniform. Block 
130 symbolizes the unitary medium such as tape or disc that receives the encoded data. This may 
relate to direct writing in a write-mechanism-plus-medium combination. Altematively, the medium 
may be a copy fi"om a master encoded medium such as a stamp. Storage may be optical and fully 
serial, but other configurations may be used. In block 132, the various words are read again from the 
medium. Then the clue words of the first code will be sent to decoder !34, and decoded as based on 
their inlierent redundancies. Furthermore, as will become apparent in the discussion of FIG. 5 
hereinafter, such decoding may present clues on the locations of errors in other than these clue words. 
Moreover, the information from synchronizing bit columns may !:e analyzed as to interferences 
therein, to immediately produce additional clues for the target words. Box 135 receives all clues and 
contains a program for vsing one or m.ore different strategies to translate such clues to erasure 
* locations. The target words are decoded in decoder 136. With help fi-om the erasure locations, the 
error protection of the target words is raised to a higher level. Fin>:illy, all decoded words ar^. 
demultiplexed by m.'^jans of element 138 to output 140 conforming to the original format. For brevity, 
electromechanical interfaces bet^A^een the various subsystems were ignored. 

FIG. 5 illustrates a simple code format. The coded information has been shown arranged in a 480 
symbol block with 15 horizontal rows and 32 vertical rows. Storage on a medium starts at the top ief»: 
and proceeds along vertical rows. The hatched region contains check symbols: horizontal rows 4, 8, 
and 12 have 8 check symbols each and constitute clue v/ords. The other rows contain 4 check symbols 
each and constitute target words. The whole block has 408 information symbols and 72 check 
symbols. The latter may be localized in a more distributed manner over respective words. Further to 
the above, the top horizontal row contains a representation of sj'nchronizing bit groups. These are 
present on the medium to synchronize a reader device to the format, but generally contain neither 
system data nor user data and have a prescri.ed format with much redundancy. Therefore, it is often 
easy to detect an interference, and the occurren'^e of a single one or a plurality of distiirbed 
synchronizing b:t groups that aie phyc^ca-Ay close tc e',ch other or to distributed clue symbols can b^ 
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used to signal occurrence of a burst error. This will produce clues in similar manner as the clue v/ords. 

The Reed-Solomr ■ code allows to correct in each clue word up to four symbol errors, -tctaal sy_T*bc • 
errors have been indicated by a crosses. In consequence, all clue words may be decoded co.Tectly, 
inasmuch as they do not have more than four errors. Notably words 2 and 3 may however aot be 
decoded on the basis of their own redundancy alone. In the Figure, all errors, except 62^ 56, 68 
represent error strings. Only strings 52 and 58 cross at least three consecutive clue words a:id are 
considered as error bursts, causing at least all intermediate symbol locations to get an erasure flag. 
Also, target words just before the first clue word error of the burst and target words just after the last 
clue word error of the burst may get an erasure flag at that location, depending on the strategy 
followed. String 54 is not considered a burst, because it is too short. 

As a consequence, two of the errors in word 4 produce an erasure flag in the associated vertical rows. 
This renders words 2 and 3 correctable, each with one error symbol and two erasure symbols. 
However, random errors 62, 68, nor string 54 constitute clues for words 5, 6, 7, because each of them 
contains only one clue word. Sometimes, an erasure cause a zero error pattern, because an arbitrary 
error in an 8-bit symbol has a 1/256 probability to cause again a correct symbol. Likewise, a long 
burst crossing a particular clue word may produce a correct symbol therein. By a bridging strategy 
between preceding and succeeding clue symbols of the same burst, this correct symbol is then 
incorporated into the burst, and in the same manner as erroneous clue symbols translated into erasure 
values for appropriate target symbols. The above decisions may vary according to decoding policy. 
The clues by the sync bit groups may be used in similar manner as those from the clue words. 

The relevance of the present invention is brought about by newer methods for digital optical storage. 
A particular feature is that in the cas^ of substrate incident reading the upper transmissive layer is as 
thin as 100 micron. The channel bits h^ve a size of some 0. 14 micron, so that a data byte at channel 
rate of 2/3 will have a length of only 1.7 microns. At the top surface the beam has a diameter of some 
125 microns. A caddy or envelope for the disc will reduce the probability of large bursts. 
Non-conforming particles of lejs than 50 microns m.ay cause short faults, although the invention is 
also usefiil against longer faults. A fault n.odc' has been used wherein 50 micron faults through error 
propagation may lead to bursts of 200 microns, corresponding to some 120 Bytes. A particular model 
has fixed size bursts of 120B that start randomly with a probability per byte of 2.6*10.sup.-5, or on 
the average one burst per 32 kB blo':tk. Th^ invention has been conceived for optical disc storage, but 
other configurations such as multitrack tape, and other technologies such zc magnetic and 
* magneto-optical would also benefit from the improved approach. 

Discussion of a Prefeired Format of the Information 

Before recording, User Data received fcm a source, that may be an application or a host, is formatted 
in a number of successive steps, that v/iil h? explained more in detail with reference to FIG. 19, as 
follows: Data Frames, Data Sectors, ECC Sectors, an ECC Cluster, a BIS Cluster, a Physical Cluster 
and Recording Frames. 

The data is recorded in 64 k partitions, callea Physical Clusters, that each contain 32 Data Frames 
with 2048 bytes of User Data. A Physical Cluster is protected by two enor correction mechanisms: 

firstly, a Long Distance (LDS) (248,216,33) Reed-Solomon (RS) error correction code; 

secondly, the data is multiplexed with a BuiSt Indicator Subcode (BIS), that consists of (67/ 0,33) 
Reed-Solomon (RS) codewords. The number of parity symbols is equal for the two ::ode categories. 
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which allows to use the same decoder hardware for both cases. The BIS code is used to indicate long 
burst errors, by which the LDS code can more efficiently perform erasure corrections. All data is 
arranged in an array as shown in FIG. 6. Note that the horizo;r^?il and vertical directions have been 
interchanged witl respect to FIG. 5. The array ;j read along -i horizontal direction, and I: recorded cn 
the disc after inserting synchronization patterns rnd additional d.c. control bits, anc' an^r modulation. 

The error correction codes are applied in vertical direction, which gives a good basic break-up of 
burst errors on the disc. Addition^iUy, the LDS co^ev/oids have been interleaved in a diagonal 
direction. For the purpose of addressing, a whole Physical Cluster is subdivided into 16 Physical 
Sectors, each consisting of 32 consecutive rows. 

A Data Frame consists of 2052 bytes: 2048 User Data bytes numbered d.cub.o to d.sub.2047 and 4 
Error Detection Code (EDC) bytes numbered e.sub.2C48 to e.sub.205L The bytes e.sub.2048 to 
e.sub.2051 contain an Error Detection Code computec over 2048 bytes of the Data Frame. The Data 
Frame is considered as a field of single bits, starting with the most significant bit of the first User 
Data byte d.sub.O and ending with the least significant bit of the '?st EDC byte e.sub.2051. The msb is 
b.sub.l6415 and the Isb is b.sub.O. Each bit b.sub.i of the ELC is shov/ri ^s follows for i=0 to 31: 
##EQU1## 

where: ##EQU2## 

Next, iwo Data Frames (A, B) are placed in an array of 19 columiis by 216 rows, called a Data Sector. 
The filling of the array is done column by column, starting at the top of tlie first one with byte 
do.sub.0,A and ending at the bottom of the last column with byte e.sub.2051,B, cf FIG. 8. 

Next in FIG. 9, the bytes in each column of the Data Sector are renumbered starting fi:*om the top of 
the column as follows: d.sub.L,0, d.sub.L,! . . . d.sub.L,i ... to d.sub.L,215, L being the column 
number (0 . . . 18), The ECC Sector is completed by extending each column v/ith 32 Parity bytes of a 
(248,216,33) longdistance RS code. Tie Parity bytes are: P.sab.L,216, P.sub.::.,217, . . . F.sub.Lj . . . 
to P.sub.L,247. 

The long distance RS code is defined over the finite fi^ld GF(2,sup.8). Non-zero elements cf the finite 
field GF(2.sup.8) are generated by a primitive element .alpha, ihat is a root of the primitive 
polynomial p(X)=X.sup.8 +X.sup.4 +X.sup.3 +X.sup.2 30 1. The symbols of GF(2.sup.8) are 
, represented by 8-bit bytes, using a polynomial base representation with (.alpha.. sup.7, .alpha.. sup.6, 
.alpha..sup.5, . . . .alpha..sup.2, .alpha., 1) as a basis. The root .alpha, is represented as 
.alpha.=00000010. Each LDS codeword, represented by the vector lds=(d.sub.L,0 . . . d.sub.L,i . . . 
d.sub.L,215 P.sub.L,216 --P.sub.LJ ~P.sub.L,247), belongs to a Reed-Solomcn code over 
GF(2.sup.8), with 216 information bytes and 12 parity bytes. Such codeword can be represented by a 
polynomial Ids(x) of degree 247, that may have some coefficients zero, the highest degrees 
corresponding to the information part cf the vector (d.sub.L,0 . . . etc.) and the lowest degress 
corresponding to the parity part (P.sub.L,2!6 . . . etc.). Now, !ds(x) is a multiple of the generator 
polynomial g(x) of the LDS codeword. The generator polynomial is: ##SQU3## 

The LDS code is systematic: the 226 information bytec appear unaltered in th^ highest order positions 
of each codeword. The parity check matrix of cede las is such ;hat: H.sub.LP S 'Mds.sup.T =0 for all 
LDS codewords Ids. 

The second row H.sub.LDS 2 of the parity check matrix H.sub.LDS is given by H. sub. LDS 2 

=(.alpha..sup.247 .alpha..sup.246 alpha.. sup.2 .alpha. 1) and correspc::ds to the zero .alpha, of the 

generator polynomial g(x) v/hich '^efines tr.^ codeword positions to be used fo: error locations. 
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After generating the LDS codewords in the ECC Sectors, 16 consecutive ECC Sectors are combined 
to one ECC Cluster by multiplexing the 16*19 columns of height 248, 2 by 7, in^ludlr^g the parities. 
In this way 152 new columns are formed with a height of 496 '^ytes as shovvn in FIG. 10. Th^ 
numbering of the bytes is according to d.sub.L,M,N where: 

L=0 ... 18 is the LDS code word number within the ECC Sector 

M=0 . . . 247 is the byte number within the IDS code word 

N=0 ... 15 is the ECC Sector number. 

For further improving the burst error correcting capabilities, extra interleaving is introduced by 
renumbering the bytes are in horizontal direction through all rows of the ECC Cluster, cf FIG. 11. 
Now all rows of an ECC Cluster are shifted two by two over mod(k*3,152) bytes to the left, starting 
fi-om row 2 onwards, the first row being row 0; k=div(row-number, 2). The bytes that shift out at the 
left side re-enter the array fi-om the right side, cf FIG. 12. After this process the bytes are renumbered 
once more in horizontal direction through all rows, resulting in the numbering D.sub.O to 
D.sub,75391 indicated in FIG. 6. The renumbering of the bytes causes non-uniform mapping of 
logical addresses on physical addresses. Consequences thereof will be discussed infra. 

After interleaving, the interleaved ECC Cluster is split into 4 groups of 38 colunms each. In betw^een 
the 4 groups, 3 columns of one byte width each are inserted. These columns carry the address 
information relating to the data contained in the interleaved ECC Cluster. They consist of (62,30,33) 
RS BIS codewords with 30 bytes of information and 32 bytes of parity. Due to the high error 
protection capabilities and an advanced interleaving scheme, these columns may also offer a reliable 
indication of burst errors. 

A 3 by 496 byte array formed by the 3 BIS columns from the Physical Cluster, is called a BIS Cluster. 
The content of the BIS Cluster is formed by placing all BIS codewords of a BIS Cluster in the 24 
columns of a 24 by 62 byte array, cf. FIG. 14, 

The BIS RS code is defined over the finite field GF(2,sup.8). The non-zero elements of the finite field 
GF(2.sup.8) are generated by a primitive element .alpha., that is a root of the primitive polynomial 
, p(x)=;X.sup.8 +X.sup.4 +X.sup.3 +X.sup.2 +1. The symbols of GF(2.sup.8) are represented by 8-bit 

bytes, using the polynomial base representation, with (.alpha..sup.7, .alpha.. sup.6, alpha.. sup.2, 

.alpha., 1) as a basis. The root .alpha, is represented as .alpha.=00000010. 

Each BIS codeword is represented by a vector bis=(b.sub.c,0 , . . , b.sub.c,i . . . , b.sub.c,29, 
Pb.sub.c,30, . . . Pb.sub.cJ . . . Pb.sub.c,61) in a Reed-Solomon cod'i over GF(2.sup.8), with 32 parity 
bytes and 30 information bytes. Such codeword may be represented by a polynomial bis(x) of degree 
61, that may have some coefficients zero, the highest degrees representing the information part of the 
vector (b.sub.c,0 . . . etc.) and the lowest degrees the parity part of the vector (Pb.sub.c,30 . . . etc.). 
Each codeword is a multiple of the generator polynomial g(x) of the BIS codeword: ##EQU4## 

The BIS code is systematic: the 30 information bytes appear unaltered in the highest order positions 
of each codeword. The parity check matrix of code bis is such that H.sub.BIS *bis.sup.T +0 for all 
BIS codewords bis. The second row h.sub.BIS 2 of the parity check matrix H.sub.BIS is given by 
h.sub.BIS 2 =(.alpha..sup.61,.alpha..sup.60 . . . .alpha.. sup.2, .alpha., 1) It corresponds to the zero 
.alpha, of the generator polynomial g(x), and defines the codeword positions to be used for error 
locations. 
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